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Abstract 


We calculate the vacuum entanglement entropy of Maxwell theory in a class of curved space- 
times by Kaluza-Klein reduction of the theory onto a two-dimensional base manifold. Using two- 
dimensional duality, we express the geometric entropy of the electromagnetic field as the entropy 
of a tower of scalar fields, constant electric and magnetic fluxes, and a contact term, whose lead¬ 
ing order divergence was discovered by Kabat. The complete contact term takes the form of one 
negative scalar degree of freedom conhned to the entangling surface. We show that the geomet¬ 
ric entropy agrees with a statistical dehnition of entanglement entropy that includes edge modes: 
classical solutions determined by their boundary values on the entangling surface. This resolves 
a longstanding puzzle about the statistical interpretation of the contact term in the entanglement 
entropy. We discuss the implications of this negative term for black hole thermodynamics and the 
renormalization of Newton’s constant. 
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I. INTRODUCTION 


The entanglement entropy of a region of space [l-^ is a quantity with broad applica¬ 
tions including to bL 


correspondence 


ack hole physics condensed matter theory and the AdS/CFT 
l7| . In all of these applications one encounters field theories with gauge 
symmetry, and for gauge theories multiple new subtleties arise that are not present in the 
case of scalar and spinor fields. For minimally coupled scalars and for spinors, the entangle¬ 
ment entropy can be computed by Euclidean methods. For nonminimally coupled scalars, 
gauge fields, and gravitons the Euclidean formula contains a contact term that does not 
have a known interpretation as entanglement entropy js-lO]. Understanding these contact 
terms has been identified as one of the major open problems in the entanglement entropy 
of black holes jd]. The goal of the present paper is to resolve these issues in the context of 
f/(l) Maxwell theory (i.e. compact QED with no charges). 

The geometric entropy of a static spacetime with a bifurcate Killing horizon (such as 
the Schwarzschild black hole, Rindler, or de Sitter) can be calculated by means of a conical 
variation of the Euclidean path integral. In terms of the covariant partition function Z, the 
geometric entropy is given by [n| 


S={l-Pdy)\nZ\0=2., ( 1 ) 

where the variation of the angular period (3 not only changes the temperature, but also 
inserts a conical singularity at the bifurcation surface of the Killing horizon. Formally, this 
is similar to a protocol used to calculate the entanglement entropy, 

F(p) =-tr(plnp), (2) 


of the reduced density matrix p of a region bounded by an entangling surface, where in this 
case the entangling surface is the bifurcation surface. While the term geometric entropy 
has sometimes been used interchangeably with the term entanglement entropy, here we wish 
to draw a distinction between the two quantities, as in general they can be different. If 
the helds couple nontrivially to curvature, the geometric entropy contains a contact term 
due to interaction of the helds with the conical singularity. These contact terms appear for 
nonminimally coupled scalar helds, gauge helds, and gravitons [4, llO|. Such contact terms 
need not have an interpretation in terms of a von Neumann entropy. 
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We will show that in the case of Maxwell theory, the contact term in fact does have a 
statistical interpretation; it is the entanglement entropy of edge modes, which are degrees 
of freedom localized on the entangling surface. 

To see how contact terms arise, a useful illustrative example is that of a nonminimally 


coupled scalar held 
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14| . Consider a scalar held 0 with the Euclidean action 




( 3 ) 


The action contains a direct coupling to curvature, which leads to a contact interaction with 
the conical singularity. The contribution of this interaction to the geometric entropy takes 


the form of a quantum expectation value of Wald’s entropy formula 
action takes the form: 

-S'contact = -27r^ / (0^) 


15 


ITj . which for this 

( 4 ) 


with the integral taken over the entangling surface. This expectation value is divergent and 
can have either sign depending on the parameter ^ of the nonminimal coupling. This term 
cannot be part of the entanglement entropy, as it would lead to the conclusion that the 
entanglement entropy in flat spacetime depends on the value of the nonminimal coupling 
parameter But the entanglement entropy should only be a function of the state p, and the 
vacuum wavefunction in flat spacetime is independent of In the case of the nonminimally 
coupled scalar, the contact term can be understood as an additional contribution to the 
generalized entropy that must be included even classically in order to obtain a quantity 
obeying the generalized second law ISj. This gives a consistent picture of the contact 
term for nonminimally coupled scalars, albeit one without a statistical interpretation at low 
energies. However, such terms may still arise from a high energy theory in which all entropy 
is statistical 141 ]. 

The geo^etne enteopy of Maxwell theoey was h^st ea.co.ated by Kabat B- He foood 

that the partition function of Maxwell theory also has a contact term which can be traced 
to the nonminimal coupling in the spin-1 Laplacian Ai = —-|- Rat- This contact term 
contributes negatively to the entropy, leading to an overall negative sign of the leading order 
divergence for D < 8. The meaning of the contact term of Maxwell theory has remained 


obscure, and t 


physical 


19|-l26| 


lere has been much disagreement as to whether it should be regarded as 


4 

















While it may be tempting to also interpret the Maxwell contact term as a Wald en 


tropy, this interpretation is untenable for several reasons 


23| . First, the coefficient of the 


nonminimal coupling in Maxwell theory is hxed; thus one cannot rule out an entanglement 
interpretation by comparing different nonminimal couplings, as was done in the case of a 
scalar held. Moreover, if one repeats the argument leading to (jl]) one arrives at the integral 
of the gauge-dependent expression where ^ Ms the projection of the gauge poten¬ 

tial onto to the normal plane of the entangling surfacqj. This would-be contact term in fact 
represents an ambiguity in the dehnition of Wald’s entropy formula la, l27| when generalized 
to the situation of huctuating quantum helds. This ambiguity was recently resolved for the 
case of classical higher derivative gravity 


28 


29l | and when ap plie c 


Maxwell theory, this rehnement of Wald’s formula gives zero 


26 


to the case of classical 


30| . Finally, unlike the 


nonminimally coupled scalar, there is no need to add an additional term to the generalized 
entropy: since Maxwell helds satisfy the null energy condition, the classical second law is al¬ 
ready satished. We take this as evidence that the divergent contact term in Maxwell theory 
cannot be interpreted as a quantum version of Wald entropy. Hence some other explanation 
for the contact term divergence is needed. 

As an alternative to the Euclidean path integral, one can calculate entanglement entropy 


using a physical regulator, such as a lattice |3ll-l35|. In Hamiltonian lattice gauge theory 


(without matter), the conhguration space degrees of freedom are integrals of the gauge held 
along links of a spatial lattice. The space of physical states is not the full tensor product 
of link Hilbert spaces, it is a quotient of this space by gauge transformations. This physical 
Hilbert space thus does not admit a canonical factorization according to regions of space. 
One approach {m, 321 is to embed the physical Hilbert space into a tensor product of local 
Hilbert spaces. The local Hilbert spaces include edge mode degrees of freedom living on the 
boundary which arise due to the Gauss constraint, that give a positive contribution to the 
entropy. □ In the case of the toric code, this dehnition reproduces the well-known value for 
the universal subleading term in the entanglement entropy (the topological entanglement 
entropy 36|, 1^) which persists in the continuum limit. In this case the entire entropy, 
^ In the family of’t Hooft gauges, the regulators can be adjusted to make the result independent of the 


gauge parameter ^ 


24j . This suggests that the contact term may contain some universal gauge-invariant 


information, and indeed we will show that this is the case. However the expression in terms of fluctuations 


of A±_ does nothing to establish its meaning as a gauge-invariant statistical entropy. 

^ This definition is closely related to the “electric” definition of entanglement entropy in Ref. 


^1 although 


there are some differences in topologically nontri^^al 


regions. 

































including its universal piece, conies from the sum over edge modes. So the edge modes are 
essential for obtaining the universal terms in the continuum entanglement entropy, and we 
will see that the same is true for Maxwell theory. 


In Ref. 


23| . the contact term was studied with a focus on the case of two-dimensional 


spacetimes, particularly those such as two-dimensional de Sitter which are compact after 
Wick rotation. There it was found that once the topological sector of the theory is treated 
correctly, the geometric entropy is equal to the entanglement entropy. However two dimen¬ 
sions is a rather special case, since two-dimensional Maxwell theory has only global degrees 
of freedom. 

The goal of the present paper, which expands on the arguments of Ref. 


extend the analysis of Ref. 


(cf. |39|]), is to 


23| to spacetime dimension D > 2, using a continuum analogue 
of the lattice entropy dehned in Refs, {m, 32|. We will use a result for the partition function 
of Maxwell theory that properly takes into account the effects of the compact gauge group 
40(] , which is reviewed in section [TTl 

We consider product manifolds of the form Bx where H is a two-dimensional manifold 
with a bifurcate Killing horizon and a compact Euclidean section (the base), and is any 
compact manifold (the fiber). For example, we can consider a geometry in which one spatial 
dimension is exponentially expanding to the past or future, while the other dimensions stay 
a fixed size. This is represented by a geometry dS 2 x J^, which Wick rotates to x J^. 

We then treat the contact term by Kaluza-Klein reducing onto B in section IIIII The 
U(l) Maxwell theory on the manifold B x T reduces to multiple U(l) Maxwell theories on 
B (representing electric and magnetic fluxes), together with a number of periodic massless 
scalar fields and towers of massive scalar and vector fields. The advantage of this reduction 
is that we can dualize all vector degrees of freedom on B to scalar degrees of freedom. This 
leaves us with towers of fields for which the geometric entropy agrees with the entanglement 
entropy; what remains is the contact term. This contact term takes the form of a negative 
scalar field confined to the entangling surface. Its leading divergence agrees with the result 
of Ref. js], but we also establish the existence of subleading and finite terms, some of which 
are universal, i.e. independent of the regulator scheme. 

In order to give a statistical interpretation to the contact term, we consider regulating the 
conical singularity by introducing a “brick wall” ^ at a short distance e from the entangling 


surface. When standard boundary conditions are fixed at the brick wall, the geometric 
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entropy formula has a statistical interpretation as the entropy of a thermal ensemble with 
hxed boundary conditions. However, the brick wall does not capture the correct physics 
of the entangling surface, which does not obey any boundary conditions. In section IIVI 
we discuss how the partition function changes under the introduction of a brick wall. For 
magnetic conductor boundary conditions, the partition function is the same as if there was 
no brick wall, except for a small correction coming from exchanging Dirichlet with Neumann 
boundary conditions on some of the scalar helds (calculated in section |V]), and an edge mode 
contribution. 

In section IVII we explain more carefully the origin of the edge modes, and calculate their 
partition function. This allows us to conhrm that the geometric entropy agrees with the 
statistical entropy. In particular, the contact term captures the entanglement entropy of the 
edge modes. 

In section IVIII we consider the case of four-dimensional spacetime. Four dimensions 
is special because Maxwell theory is conformal, and so the logarithmic divergence of the 
entanglement entropy is universal and should be related to the trace anomaly 4^, l43|]. But 


the trace anomaly result was found to be in conflict with the entanglement entropy calculated 


by thermodynamic methods 


25 




44j |. We resolve this puzzle by showing that when the 


edge modes are included, the entanglement entropy agrees with the trace anomaly. We 
comment on the implications for the holographic entanglement entropy at strong coupling, 
which we argue must already contain an edge mode contribution from the strongly coupled 
Yang-Mills theory. 

In the Discussion, we explain why the leading order contribution found in Ref. js] is 
negative, and explain how the sign of the leading order term depends on the choice of cutoff. 
We also discuss possible extensions of our work to nonabelian gauge helds and gravitons, 
or to entangling surfaces without Killing symmetry. Finally we discuss the implications for 
black hole physics and the renormalization of Newton’s constant. 


II. MAXWELL THEORY 

To prepare for Kaluza-Klein reduction, we hrst consider the partition function of Maxwell 
theory with gauge group U(l) on a compact Euclidean manifold M. 

Locally the electromagnetic held can be represented as a 1-form A up to local gauge 
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transformations Aa ^ Aa + VaO where a is a scalar. The Euclidean action I is expressed 
in terms of the electromagnetic field tensor Fab = "VaAb — VbAa as 



(5) 


The partition function is then given formally by the Euclidean path integral 



( 6 ) 


There are global issues that arise from the U{1) nature of the gauge held: 

First, we must identify any two 1-forms A and A! such that around every closed curve 7 



(7) 


This requirement ensures that a particle whose charge is a multiple of q cannot distinguish 
A from A' when transported around a noncontractible curve. Equivalently, one can allow 
for the parameter a of the gauge transformation to be identihed under a ^ a + 27r/q when 
going around a noncontractible curve. These are the large gauge transformations. 

Second, we must include held strength tensors F that can be expressed as F = dA locally, 
but globally require gluing multiple A helds together with the requirement that f A agrees 
up to a multiple of 271/q where the patches overlap. This leads to the Dirac quantization 
condition, §^F G 277 jq for every closed 2-surface D. The set of such held conhgurations is 
discrete, so they are summed over in the path integral. 

Though for many purposes one does not need to distinguish between the gauge groups 
U{1) and M, here the distinction is fundamentally important. The reason is that while the 
nonzero modes of a gauge held act like harmonic oscillators, the zero modes of the M gauge 
theory act like free particles, and do not have a normalizable ground state. Since we are 
calculating the ground state entanglement entropy, a noncompact gauge group would lead 
to an infrared divergence in the entanglement entropy. Attempting to cure this divergence 
by the introduction of a mass breaks gauge invariance and leads to further problems. This 
problem is naturally cured in the U{1) gauge theory, since the zero modes are quantum 
mechanical free particles on a circle, which again have normalizable ground states. 

In Ref. 0. .he pa^thion funcon of Maxwell fheoay waa calculated tu the Euchdeau 
path integral by covariant gauge-hxing. The result can be expressed as a product of terms: 

First, there are functional determinants that arise from the Gaussian path integral over 
the nonzero modes of the vector potential and the Faddeev-Popov ghosts. In the covariant 



formalism these consist of modes of the transverse vector Laplacian Af and of the scalar 
Laplacian Aq. The longitudinal modes depend on the ‘t Hooft parameter and on a mass 
scale p appearing in the path integral measure, which we have allowed to take different values 
for the vector held and ghosts (pa). The nonzero modes contribute to the partition 
function a factor 


/ A^ \ 


- 1/2 


/ / 


•^nonzero modes dct I q ) dct I 9 /- ) 

V27r/iV V2vr/iiey 

The prime denotes that det ' is the product only over nonzero eigenvalues. 


ha 


( 8 ) 


Second, there is a factor coming from the hat connections. Since the action vanishes for 
these conhgurations, they contribute a factor of the volume of their moduli space. This 
volume is made hnite by the quotient by large gauge transformations. Let wj = wiadx°', 
/ = 1,..., 6i be a topological basis of 1-forms in Z), whose dimension is the Betti 

number bi. Any harmonic 1-form whose integrals around all closed curves are integers can 
be written uniquely as an integer linear combination of the wj. In this basis, the space 
of hat connections modulo large gauge transformations is the torus obtained by identifying 
opposite edges of the cube [0,1]^L The standard norm on vector helds pulled back to this 
space dehnes the metric on moduli space as 


r/,/= [ y/gg''''WiaWji,. (9) 

Jm 

The contribution of hat connections to the path integral is simply the volume of moduli 
space in the functional measure, and is given by 


connections 


= det 


2vr/iA 


1 2 


1/2 


( 10 ) 


Third, there is a factor associated with the constant gauge transformations. Since these 
gauge transformations do not modify A they must be treated specially; we still must divide by 
the volume of the gauge group, but the zero mode cannot be gauge hxed as is conventionally 
done for the higher modes. The result is a global factor that depends on the volume V of 
the spacetime manifold: 


■^constant gauge transformation 2 

/^o 


27iV 


( 11 ) 


We note that this term is often absent from discussions of the path integral of gauge theories. 


but its presence is essential for agreement with the canonical formalism as shown in 


40|. 
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Finally, there is a factor associated to the nontrivial bundles. These are classihed by 
the (discrete) second homology group Z), which consists of harmonic 2-forms whose 

integrals over all closed 2-surfaces are integers. Their contribution to the partition function 
is 


7 - 

•^nontrivial bundles / ^ ^ 

Fe^H'2{M,Z) 

Putting all of these factors ([8]) flTOl) ffTTl) flT^ together, the result is 

2 \ / at \ -1/2 / A \ -1/2 


( 12 ) 


Z = ^A/^det 


V 27 rt- 


27r/XA 

Q 


f AJ' 

det (-^ det 

V2vr/xy 


( det 


Ar 


E 




(13) 

We can simplify this formula by rescaling the functional determinants using zeta function 
regularization: 

det'(A) :=e-d(A,o)^ ^^4) 


where ({A,s) is the zeta function of the operator A, (C(A,s) := tr'(A“^), and tr' denotes 
omitting the zero modes of A. This formula dehnes the zeta function for Re(s) > D/2] it 
is then dehned for other values of s by analytic continuation. We can see directly from the 
dehnition that the functional determinant scales as 


det'(aA) = a^*^^’°Met'(A) (15) 

so that C(A, 0) can be thought of as a regularized number of nonzero modes. We can then 
apply the following result, that for any elliptic differential operator, 

C(A, 0) = — dimker A-|-A (16) 


where A is an anomaly that appears only when D is even and takes the form of the integral 
of a local geometric quantity. The anomaly can be cancelled by a local counterterm; we can 
therefore ignore a hnite shift in this term. The factors of ha, and ^ from scaling the 
determinants cancel with the other terms in Eq. (IT^ . 

Thus upon rescaling the functional determinants, we see that the partition function does 
not depend on the measure factors /i or the gauge parameter ^ (as should be the case on 
physical grounds): 


Z 


27 rVo\{M) 


det 



det^(Ao) 
det '(Ai)^A 







(17) 


-S[F] 
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By scaling out the factor of ^ from the longitudinal determinant we have effectively chosen 
Feynman gauge (.^ = 1), and this has allowed us to combine the longitudinal and transverse 
components of the vector held into a single determinant. 


III. KALUZA-KLEIN REDUCTION 

We now consider Kaluza-Klein reduction of Maxwell theory on a manifold of the product 
form A4. = B X . Here H is a two-dimensional base manifold and is a H — 2 dimensional 
compact hber: B will contain the directions normal to the entangling surface, and the 
directions along the entangling surface. We will carry out the reduction at the level of 
the partition function in order to keep off-shell effects to which the entanglement entropy is 
sensitive. The purpose is to divide the Maxwell partition function into a factor for which the 
geometric entropy formula ([T]) agrees with the entanglement entropy of the on-shell degrees 
of freedom, and another portion that we identify as the contact term. 

We will show that the Maxwell partition function can be written as a product of partition 
functions 

Z = Zscalars^E^B^x (1®) 

where ^scalars is a tower of scalar helds on B] Ze and Zb are two-dimensional Maxwell helds 
on the base corresponding to electric and magnetic fluxes respectively. For these first three 
contributions, the geometric entropy is equal to the entanglement entropy. The remaining 
factor is the contact term, whose interpretation will be the subject of section IVIl 

A. Unpacking the partition function 

We first consider the functional determinant piece of the partition function ([8]). On a 
product manifold BxT we can split the vector determinant into a contribution from vectors 
polarized along the base B, and a contribution from vectors polarized along the fiber This 
is expressed in the identity 

det'(Ai) = det'(Af ©A^)det'(A®©Af). (19) 

Viewed from the base manifold, the vector field breaks into a Kaluza-Klein tower of vector 
fields whose masses are given by the spectrum of the scalar Laplacian on the fiber, G 
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spec(Aj^), and a tower of scalar fields whose masses are given by the spectrum of the vector 
Laplacian on the hber, G spec(A^). This functional determinant, together with the 
scalar functional determinant of the ghosts, encodes all the {D — 2) local bosonic degrees 
of freedom of the D-dimensional Maxwell held. We now turn to the remaining parts of the 
partition function that describe the topological sector. 

We can also decompose the moduli space of hat connections into hber and base polariza¬ 
tions. Letting denote the metric on the space of hat connections on Wl, we see that 
the metric on the product splits as = Vol(D)r*^^) © Vol(D)r('^), so that: 

v—?—) ■ * ' 

Here we have dehned = q/ ^Vol(D), the fundamental charge of the two-dimensional 
Maxwell theory on B. 

We can now express the prefactor from the gauge zero modes flTTD in terms of gg as 




27rVol(Wl) V27rVol(H)’ 


( 21 ) 


which we recognize as the gauge zero mode term for two-dimensional Maxwell theory on B, 
with fundamental charge q^- 

Since the bundles correspond to harmonic two-forms, they can be divided into three 
types depending on which two directions the two-form point: along the base, along the 
hber, or both. The harmonic two-forms on the two-dimensional base B can be expressed as 
Fab = f^abi where / is constant and eab is the volume form on B. They are quantized so 
that = Vol{B)f G Their contribution to the partition function is: 


E 


E 


( 22 ) 






where we have dehned the rescaled held tensor F' = ^yVo\{F)F. In this form, we see that 
it is equal to the sum over the nontrivial bundles of Maxwell theory on B with fundamental 
charge qs- 

The bundles pointing along the hber can be expressed similarly as 

^ e-lV„i(B)/,f> (23) 

fg^h^{f,i,) 
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These correspond to magnetic fields in the fiber directions that are constant along the base. 

The mixed bundles that point in both base and fiber directions can be expressed in terms 
of a basis wf of Z) and a basis Wj of Z). A general mixed element of Z) 

then takes the form 


F = 


{ 2ti\ „ j 

( — 1 mijWj A Wj 


(24) 


where mu is a matrix of integers. The contribution of these mixed bundles to the partition 
function is 

1 


exp 


mij 


^ ^ IJKL 


(25) 


Using these identities, we can express the original partition function of Maxwell theory 
flT7|) as a product of field theories defined on B. However Eq. flT^ contains a tower of vector 
fields, each of which has a contact term. In order to isolate this contact term, we will first 
trade these vector degrees of freedom for scalars. 


B. Proca-scalar duality 

The Kaluza-Klein description of Maxwell theory includes a tower of vector fields on the 
base manifold B. Since the vector Laplacian contains an effective nonminimal coupling to 
background curvature, the vector fields will include a contact term in addition to the contri¬ 
bution from their on-shell degrees of freedom. In order to disentangle these two contributions 
we make use of massive p-form duality to relate the massive vector fields to dual scalar fields 

*/*dual • 

We perform a Hodge decomposition of the operator Af, expressing the vector field Aa 
as an orthogonal sum of exact, co-exact and harmonic vector fields. In two-dimensions this 
takes the form Aa = Va0 + + Ba-, where 0 and ijj are scalars and Ba is a harmonic 

vector field. In terms of spectra, this says that the spectrum of the vector Laplacian Af is 
two copies of the spectrum of the scalar Laplacian Af, up to zero modes. This leads to the 
functional determinant identity for m > 0: 

det(Af -I- m^) = det(Af -|- (26) 

The Euler characteristic x{B) comes from the difference between the number of zero modes 
of a vector and two scalars: a vector has bi zero modes, while a scalar has zero modes. 
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and 2bo — bi=x- In the massless sector, we do not include the zero modes in the functional 
determinant, so there is no Euler number correction and we have simply 

det'(Af) = det(Ag )^. (27) 

When we take the product of Eq. fl2B]) over the spectrum of Kaluza-Klein masses, we 
obtain the identity 


det '(Af © A^) = det '(A® © A^)^ det (28) 

The hrst factor describes two scalar helds on x when we apply this identity to the 
Maxwell partition function it will cancel with the two Faddeev-Popov ghosts. The remaining 
term takes the form of —x{B) scalar helds on We will see that this gives the contribution 
of the nonzero modes to the contact term. 

This relation between functional determinants can be understood in more physical terms 
via the 2D duality between the massive vector (Proca) held and a massive scalar held. Recall 
that the Proca action for a massive vector held is 

/ = + (29) 

where the mass term breaks the gauge symmetry 6Aa = VaCr of the massless vector held. 
However, it is possible to restore the gauge symmetry by adding an additional scalar held g 
which transforms as 6g = ma, so that the combination Vag — ^Aa is gauge invariant. One 
can then write the action in the equivalent Stueckelberg form: 

I = j \Fab^ + \{^ag-mAaf, (30) 

where the equivalence to the Proca form can be shown to hold (even oh-shell) by gauge-hxing 
so that (? = 0. 

When we KK reduce the Maxwell held, the tower of massive vector helds naturally appear 
in this Stueckelberg form. Aa is proportional to the vector held polarized on the base, while 
the Stueckelberg mode g is proportional to the corresponding longitudinal mode on the 
hber. These two modes are related by a gauge symmetry, which comes from reducing the 
higher dimensional gauge symmetry. In Feynman gauge, Aa, g, and the ghosts all propagate 
independently, so each massive vector held has +1 degree of freedom, just like a scalar held. 
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On-shell, the Proca held is dual to a scalar held 0 via the duality 

F,be“V2 = mcj). (31) 

Although this duality does not make sense as a substitution into the action, it preserves the 
Hamiltonian and the equations of motion. 

This on-shell duality explains why the partition function of these modes is equivalent to 
a massive scalar, up to the contact term 

det'(A^)^s/l (32) 

This term is an oh-shell ehect, so the duality does not know about it. Each Proca held 
contributes a factor of m^, coming from the diherence in the number of zero modes between 
the Proca and scalar held. 


C. The contact term 


Using the decomposition of the partition function IIIIAl and the Proca-scalar duality Hi I Bl 
we will now rewrite the Maxwell partition function in a way that isolates the contact term. 

After applying the Proca-scalar duality all the local degrees of freedom of Maxwell theory 
are expressed in terms of vector helds polarized along the hber directions: 

det'(A^© Af)-i/l ^33) 

The factor (l33|) includes both massive and massless scalar helds on the base. Some of the 
massive scalar helds, (/)duai, came from dualizing Proca helds—these correspond to the modes 
of Af which come from diherentiating A^ modes. The rest of the scalar modes (which we 
shall call (/>kk) come from direct KK reduction. For each positive eigenvalue of Af, Eq. fl55]) 
gives the partition function of a massive scalar held on B. 

Among the 0 kk helds, there is one massless scalar held on the base for every vector zero 
mode on the hber; for these Eq. fl33|) gives only the contribution from the nonzero modes. 
These massless modes are in fact periodic scalars; their full partition function consists of 
the zero modes of the scalar determinant fl5HD . the right factor of fl2UD . and the bundle sum 
). Combining these factors we hnd the partition function of a linear a-model: 




1/2 


,2 ■) E “p 


27rY 
Q ) 




I,J,K,L 


(34) 
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The target space is, up to a prefactor, the metric on the space of flat connections on 


—) r^. 
Q J 


(35) 


The functional determinant in flMj) comes from the nonzero modes, the second factor is the 
integral over the (compact) zero mode, and the sum is over winding sectors. 

Next we consider the massless vector modes on B. Combining the moduli space of the 
base (1201) . the volume of the gauge zero mode fl2T]) and the bundles wrapping the base 
fl^2]) . we obtain two-dimensional Maxwell theory on B times a contact term. After Poisson 
summation, the partition function of the constant electric field on B is 


Ze= 


g-ivoi{n)i?2 


(36) 


while the two-dimensional contact term is given by 

X{B) 


Qb Y 


(37) 


When we vary the conical angle /S, the volume of B is proportional to f3, and so the first 
factor (136|) takes the form of a canonical partition function. The energy levels are precisely 
those of the quantized electric field E on the base, for which the geometric entropy ([T]) gives 
the entanglement entropy 231. 

The bnndles polarized along the hber (12^ describe qnantized magnetic helds wrapping 
the fiber directions. This contribution is already expressed as a canonical partition function, 
similar to (l36l) except that their values are quantized on the lattice Therefore 

there is no contact term coming from these “magnetic” two-dimensional Maxwell fields. 

With all the local and topological degrees of freedom accounted for, we are left with the 
contact term that is the product of fl32l) and fl371) : 

-X{B) 


Zx = 


\/2vrVj- 


q 


det '(A 


J-5-1/2 
0 ) 


(38) 


The geometry of the entangling surface consists of x(i3) copies of E, so that is the 
partition function of a scalar held localized on the entangling surface. However, the sign 
in the exponent is opposite that of an ordinary bosonic scalar held. The leading order 
divergence in the contact term agrees with the expression fonnd in 8| , which leads to negative 
entropy when regulated by heat kernel methods. 
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IV. INTERPRETING THE CONTACT TERM 


Given the Kaluza-Klein reduction of Maxwell theory it is now straightforward to cal¬ 
culate the geometric entropy via the conical variation ([T]). Since the partition function is 
expressed as a product, it is sufficient to calculate the entropy associated to each set of 
modes separately. We can then ask whether each individual factor can be given a statistical 
interpretation. The local degrees of freedom of Maxwell theory appear after Kaluza-Klein 
reduction as a tower of free minimally coupled scalar fields. Because the scalars are min¬ 
imally coupled, the geometric entropy yields exactly the entanglement entropy, which is 
well-known for free scalar theories (see e.g. 45|b 

Since the contact term is independent of f3, its contribution to the entanglement entropy 
is just = InZ^. The leading order area-law contribution is like the cosmological constant 
induced by —1 scalar fields living on the entangling surface. In a heat kernel regulator, the 
negative entropy in fact overwhelms the positive sources of entropy in dimensions D < 8. 
Thus the ghost scalar helds in the contact term js] render the total entropy negative, and 
have no obvious statistical interpretation in terms of the actual positive degrees of freedom. 

One might think that this negativity is of little consequence since (in D > 2) the area 
law divergence is a power-law divergence, and the coefficient of power law divergences are 
nonuniversal. However, the KK reduction makes it clear that In also contributes to the 
logarithmic divergence (in even dimensions) and to the nonlocal hnite piece of the geometric 
entropy S. This is clear from the fact that it is proportional to the effective action of a D — 2 
dimensional scalar field. Thus the problem is not an artifact of the renormalization scheme. 


30 


and cannot be safely removed from the partition function without consequencel 

While subleading terms of divergent quantities can be negative, the negativity of the 
Kabat term was only a symptom of a deeper concern: is there a statistical mechanical 


interpretation for these extra terms? Using the same methods as in 38|, we will show that 
the answer is yes. 

In section IVII we will calculate the partition function in a new way, which makes the 
statistical origin of this contact term clear. It turns out to be related to the phenomenon 
of “edge modes”, new degrees of freedom that appear when restricting a gauge theory to a 
region with boundary. The contribution of the electric and magnetic 2D Maxwell fields have 
^ Among other things, this would make Z no longer invariant under exchanging the roles of B and a 2D 
factor manifold of tF. 
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a state-counting interpretation related to edge modes, as elucidated in [2^,133,14^. We wish 
to show that the contact term Zy_ can also be given a statistical interpretation in terms of 
these edge modes. 

To do this, we will regulate the theory using an t’Hooft brick wall at a proper distance e 
just outside the entangling surface. To so so we have to choose boundary conditions at the 
brick wall, and the key criterion is that they must not affect the physics far from the wall. 
However, neither of the standard boundary conditions for Maxwell helds have this property. 


If we were to impose electric conducting boundary conditions 


fl 


a = 0, Ai = 0, VrAr — KAr = 0, (r = e) (39) 

(where i points along the brick wall and r is the proper radial distance coordinate, and 
K = KijQ^^ is the trace of the extrinsic curvature), we would hnd that there can be no 
magnetic flux Fij through the entangling surface. On the other hand, if we imposed magnetic 
conducting boundary conditions!^ 

Vr<y = 0, Ar = 0, VrAi — KijA^ = 0, (r = e) (40) 

then the held strength satishes 


Fir = 0, drFij = 0, (r = e) (41) 

and so we would hnd that there is no electric hux Ej_ = Frr (where f is the unit angular 

direction around the brick wall in an orthonormal coordinate system). But in reality the 

entangling surface is not a physical barrier, so both kinds of hux are allowed. Thus neither 

of these boundary conditions are acceptable. 

Our solution will be to impose the magnetic boundary conditions with an arbitrary choice 
of E±, by replacing the last equation of fl40|) with 

VrAi-K,,A^ =f,E^{F), (r = 6). (42) 

Then we will compensate by doing an explicit path integral over all possible choices of E±. 
This allows for both electric and magnetic huxes through the entangling surface. We will 
dehne Zbuik as the partition function of the bulk region outside the magnetically conducting 

^ also known as “relative” boundary conditions 
® also known as “absolute” boundary conditions 
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brick wall (with E± = 0), and Z^dge as the correction coming from the path integral over 
the edge modes; thus the total partition function with the wall is 


Z -^bulk-^edge- 


(43) 


Below, we will prove that (H3|) agrees with the partition function with no brick wall, even 
though x{^) = 0 for fhe brick wall system (since x(«S'i) = 0) so that the contact contribution 
of fl38p does not contribute. Nevertheless the contact term still arises in a different way, from 
the edge mode contribution. 

It is almost (but not quite) true that the edge modes .^edge give rise to the contact term 
Z^ ll38ll together with the sum over the constant mode of the electric flux Ze ([36]) • There is 
also an additional term Ze/n coming from the difference between Neumann and Dirichlet 
boundary conditions for scalars on the brick wall. 

In the case of a scalar held it is best to impose Neumann boundary conditions, because 
this changes the held as little as possible far from the entangling surface. In the limit e —?■ 0, 
there is no ehect on the partition function of a scalar held other than through local power 
law divergences, which are not universal 471. 

On the other hand, for massive scalar helds the Dirichlet boundary conditions have some 
additional subtleties, including UV divergences of the form Inlne”^ in the entanglement 
entropy This means that the entanglement entropy of the Dirichlet scalar does not 


quite correspond to the geometric entropy without any brick wall. 

In the case of the Maxwell held coupled to a magnetically conducting brick wall, the 
tower of scalar helds obtained by KK reduction have Neumann boundary conditions, as can 
be seen from Eq. (14T1) and the fact that 0 kk is proportional to the magnetic held Fij (or to 
Ai in the massless case.) 

But the tower of scalar helds dual to the Proca helds have Dirichlet boundary condi¬ 
tions. This can be seen by substituting the Proca-scalar duality relation = m^duai 

fl3ip into Eq. fl4ip . where “^Fah is the KK reduced 2-dimensional held strength, which is 
proportional to the D-dimensional held strength Fab polarized along the base B. 

If all of the scalars had Neumann boundary conditions, then because they form a complete 
set of modes of Af, the ehects of imposing the brick wall would add up to a contribution 
which is local along the hber F. This means that they could be absorbed into nonuniversal 
local counterterm on the brick wall. But in fact, some of the scalar helds have Dirichlet 
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boundary conditions, and we must take this into account. 

Let us define as the ratio of the partition function for the 0duai modes with Dirichlet 
boundary conditions, compared to the partition function of these same 0duai modes but with 
Neumann boundary conditions. Then the partition function in the presence of a magnetically 
conducting brick wall is 

•^bulk ^B^scalsuTS^ D/N1 

where the ^scalars term refers to the full tower of all scalar fields ^kk, 0duai with Neumann 
boundary conditions, which is equivalent to no brick wall. 

In section |Vl] we will show that 

ZedgeZn/N = Z^Ze (45) 

which implies that 

Z = ZgQg^igj-gZEZ eZ^. = .^bulk-^edge (46) 

where the first expression for Z is the geometric partition function with no brick wall, and 
the second is the brick wall plus edge modes. Thus we will find an exact agreement between 
the two partition functions. Since the corresponding entropies S'buik and S'edge both have a 
statistical interpretation, the entire entropy has a statistical explanation. 

Thus we have explained Kabat’s contact term in terms of the statistical mechanics of 
edge modes, without needing to appeal to the negative entropy ghosts. The reason why 
Kabat obtained a negative leading contribution to the entropy will be discussed in section 

NM 

V. DIRICHLET VERSUS NEUMANN 

In this section we calculate Ze/n, which is the ratio of the Dirichlet and Neumann 
partition functions for the dual scalar modes, which are massive fields on the base B. Recall 
that there is one dual scalar mode for every nonzero scalar mode on the fiber T. 

Let us consider the partition function Z for the manifold where Bjs is the conical 
manifold with angle (3 going around the entangling surface, used to calculate the geometric 
entropy for some particular field of mass m in ([T]). As we zoom in on the entangling surface 
in the base B, it is approximated by a cone with a small disk of radius e cut out of the tip, 
on which we put either Dirichlet or Neumann boundary conditions. 
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Consider radial evolution outward from the brick wall in the coordinate p = Inr. We 
can most easily analyse this problem by doing an exponential conformal transformation 
with Weyl scaling = 1/r = e~^ in order to transform the plane into the (r, p) coordinate 
system, with the Cartesian metric = dp’^+djS'^. The angular coordinate r G [0, f3) remains 
periodic, while p G [Ine. Ini?), where R is the characteristic length scale of the manifold B 
at which the flat approximation is no longer valid; for example, if S is a sphere R ~ rgphere- 
In 2 dimensions, the propagator of a minimally coupled scalar held is conformally invariant, 
while the mass term is not. Therefore the mass term becomes position dependent: 

/[0] = j drdp^ [(Va0)^ + , (47) 

and we can ignore the mass term as p ^ —oo. 

The mass and/or the curved geometry provides a somewhat fuzzy cutoff on one side of 
the cylinder, but the precise details will turn out not to matter, so long as we take the order 
of limits so that the conformally transformed “distance” to the brick wall e is larger than any 
other scale in the problem. This is valid if we take the brick wall radius e to be parametrically 
small compared to the UV cutoff of the theory which cuts off the contributions from large 
transverse momentum m. Hence we wish to analyse the theory on a cylinder of length 
ln{R/e) and periodicity (d. If we abuse dimensional analysis by assuming Ini?, Inm ~ 1, we 
may write the length as ln(e“^). 

The important thing to notice is that the theory on the cylinder is massless, but not 
periodic. Hence there is an IR divergence in the theory, which manifests as the absence of 
a mass gap on the cylinder when evolving along the p direction. 

The modes with nonzero r-momentum are gapped, since they correspond to harmonic 
oscillators. Any excitiations of these modes due to the boundary condition decay rapidly 
away from the entangling surface, so that their contribution is purely local. This simply 
shifts the nonuniversal power law contributions to the entropy. 

But the zero modes (j) = 4>o{p), which are constant in the r direction, are not gapped. 
This system corresponds to a free particle whose “position” x = 0o/\/^ is the canonically 
normalized zero mode of the held. Under p-evolution, the wavefunction of the particle 
spreads out as a Gaussian. If we Fourier transform to the momentum p (which is continuous 
since (pQ of the scalar held is not periodic), the wavefunction evolves from one end of the 
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cylider to the other like 


d'(Pfinal) = (48) 

where rj = ln(e“^)//5 is the length of the cylinder measured in units of its width. 

For Neumann boundary conditions, the wave function is initially a. p = 0 eigenstate 
i’Nip) = and so it is invariant under the radial evolution. On the other hand, the 

Dirichlet wave function is initially an x = 0 eigenstate = (2vr)“^/^, and under radial 

evolution evolves to 


(e ^Pd){p) = (27r' 


-l/2g-4r,p2 


e^-O 


V 




(49) 


After radial evolution, the Dirichlet wave function approaches the Neumann one times a an 
extra factor of 77 “^/^ (irrespective of the mass m of the held, which makes no difference in 
the limit e —)■ 0 ). 

It is interesting to note that because the log which appears in the geometric entropy 
formula ([T]) stacks onto the log inside p coming from the conformal transformation to the 
cylinder, the entropy of the 2D Dirichlet massive scalar on an interval with two endpoints 
ends up having a surprising divergence structure: 

S'=lne“^ — Inlne”^ + hnite, (50) 

o 


where the hrst term is the normal log divergence, which in this calculation comes from the 
Casmir energy of the cylinder vacuum, and the second term is a log of a log. We plan to 
address the signihcance of this term for scalar helds in another paper j^. For Maxwell 
helds with our choice of boundary conditions, this peculiar term will end up cancelling with 
another term coming from edge modes fsection lyT]) . 

We conclude that imposing Dirichlet boundary conditions leads to an extra factor of 
for each of the modes of except the zero mode. Since there are y points on the entangling 
surface of B, this factor comes in y times and thus we may write 


Zd/n — 


det ' 


ln(e ^) 


^0 


det' (A^) 


-x/2 


(51) 


Note that, because zeta function regularization is not invariant under multiplying functional 
determinants term-by-term, we may not cancel the factors of in the determinants in the 
naive way. We may however rescale the determinants by shifting each term (including the 
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zero mode of by a constant. Up to an unimportant local anomaly term (which only 
affects scheme-dependent quantities), we obtain 


Zd/n 




(52) 


This correction will be important for establishing the equivalence (1451) between the entropy 
calculations with and without the brick wall. 


VI. ENTANGLEMENT ENTROPY AND EDGE MODES 


32j, and then take 


To interpret the contact term as an entanglement entropy, we return to the question 
of how to dehne entanglement entropy in a gauge theory. We hrst recall the dehnition of 
entanglement entropy for Hamiltonian lattice gauge theories used in Ref. 
its continuum limit. 

In the Hamiltonian formulation of U{1) lattice gauge theory, a convenient basis for the 
Hilbert space is the electric field basis. Each vector in this basis is labelled by a quantized 
electric flux Ey^ G qZ assigned to each oriented link {v,w) of the lattice. These must obey 
Gauss’ law, that the electric flux at each vertex is zero, Yhy^vw = 0. The Hilbert space is 
spanned by superpositions of these electric states. 

To dehne the entanglement entropy on this lattice, we partition the vertices into two sets 
A and B. The entangling surface then intersects the lattice in some set of edges. We dehne 
the Hilbert space I-La of region A to be spanned by the electric held states that include all 
edges that intersect the region, including the boundary. The physical states, those satisfying 
Gauss’ law, can be identihed as states in I-La <S)'Hb, but they do not span the full tensor 
product. The Hilbert space Ba G) "Hb includes states for which the electric hux on the 
boundary of A does not match with the electric hux on the boundary of B. Although the 
full Hilbert space "H does not admit a local factorization, we can embed the physics states 
into the tensor product I-La <8 "Hb and then dehne entanglement entropy normally. 

For gauge-invariant states, the reduced density matrix of a region commutes with the 
operator measuring electric hux through any link on the boundary. As a result, the den¬ 
sity matrix can be split into a direct sum of superselection sectors, each labelled by the 
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configuration of E± on the entangling surface: 


P = 

Ei_ 

The resulting entropy is given by 


(53) 


S = -5^p(Bi)hip(Bx) + 5Zp(B±)S(p(Bi)). (54) 

F!_\_ F!_\_ 

We will now define the entanglement entropy as the continuum limit of this expression. 

For each distribution of surface charges E±, we define the edge mode to be the unique 
static classical solution of the form E = Vip with the boundary condition V_L(p = E±. Any 
field configuration can be expressed as a sum of an edge mode and a fluctuation satisfying 
the magnetic conducting boundary condition E± = 0. Since Maxwell theory is linear, the 
action for such a configuration is the sum of the on-shell action of the edge mode and the 
action of the fluctuation. The term S{p{E±)) in fl5T|) is therefore independent of E±, so it 
is simply the entropy of the theory with magnetic conducting boundary conditions given by 

an. 

The first term in (l5^ is the entanglement entropy of the edge modes, and the second is 
the bulk entropy, so we have 

S »S'edge T >Sbulk- 


To calculate Fedge we define an edge mode partition function by integrating over the edge 
modes weighted by their on-shell action: 


y _ 

•^edge 


(56) 


The measure in this expression is taken to be the the continuum limit of the sum in fl5H) . 
We can obtain the entropy of the edge modes S'edge from the geometric entropy formula ([T]). 

In order to make sense of the formal expression fISBD we will need to introduce a short- 
distance cutoff near the entangling surface. We will do this by introducing a boundary at 
r = e, similar to the brick wall model of Ref. except that rather than fixing boundary 
conditions we sum over all values of the perpendicular electric held E^. 

In order to compare the entanglement entropy defined in (l5^ with the geometric entropy, 
it is sufficient to compare the partition functions at arbitrary /9. Previously we saw that the 
geometric partition function (with no brick wall) can be re-expressed in the form 


Z 


bulkedge* 


(57) 
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so long as the following identity is true: 


ZedgeZD/N — Z^Z^ (58) 

We now show that these two partition functions are in fact equal, so that the contact term 
does indeed have a statistical interpretation, as it contributes to the entropy of the edge 
modes. 

In IVI Al we calculate the Euclidean action for each edge mode appearing in Eq. fl56|) . and 
show that the partition function is that of a negative scalar held on the entangling surface. 
In IVIBI we compute the measure appearing in Eq. by taking the continuum limit of 
a lattice regulator. This gives the appropriate factors of q and Vol(J^) that appear in the 
contact term (l38|) . as well as the term Zd/n in (l58|) . 

A. Short-distance expansion 

In our product geometry Ai = B x J^, the entangling surface consists of some number 
of points in B each with a conical singularity of angle (3. Let us begin by choosing one of 
these points around which we will hx polar Riemann normal coordinates with the point at 
the origin r = 0. In these coordinates the metric takes the form 

ds^ = + NirYdr^, (59) 

where r is /3-periodic and the lapse function is iV(r) = r + 0{r‘^). We place a brick wall at 
r = e, and solve for the classical solution with hxed E_[_ on the brick wall. 

The solutions have qualitatively different behavior for the mode of E± constant along 
E and for the higher modes. For the constant mode the solutions have a constant electric 
held throughout B whose value is quantized in multiples of the fundamental charge This 
is precisely the contribution to the partition function Ze coming from the constant 
electric helds. We will separate out this contribution and focus on the nonzero modes in 
what follows. 

For the nonzero modes the classical solutions decay rapidly away from the entangling 
surface. This is because conhgurations with electric held lines extending far from the surface 
have a large boost energy. Because these solutions closely hug the entangling surface they 
have a small action associated with them - and they make a large contribution to the 
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partition function that is local to the entangling surface. Since the solutions contributing to 
the sum over edge modes do not extend far from the entangling surface, it will be sufficient 
to treat each connected component of the entangling surface independently. 

For each distribution of surface charge E^{x), we have to hnd the action of the corre¬ 
sponding classical solution. Let us expand the vector potential in modes 'ipn{x) of the hber 
E as 

A = An(pn{r)i>n{x)dT, (60) 

n 

We will take rjjn to be eigenfunctions of the scalar Laplacian on the hber 


normalized so that Jjr = 1- 

In order for A to describe a classical solution, we must have die F = 0: 

F = {dr^n)'4’n dr f\ dx + (fnd'ipn A dr (61) 

-kF = N~^ [{dr(Pn)'lpnyo\jr + iejr d^j^n A dv] (62) 

die F = {drN~^dr(pn) 'ipn dr A Volj- -|- N~^(pn {d iejr dipn) A dr (63) 

= {drN~^dr^n “ N~^ Xn^n)i^ndr A Volj-. (64) 

Thus the classical equation of motion reduces to 

NdrN~^dr^n = K^n- (65) 


The on-shell action of this solution can be obtained by using the equation of motion to 
integrate by parts. This is analogous to the way in which the electrostatic energy of a system 
of charges can be expressed in terms of the potential evaluated at the charges. The on-shell 
action is given by 

5 *=^ J F A ieF j dA AieF = ^ ^ A A ieF. ( 66 ) 

This depends only on the held values at the brick wall, so we need only hnd the asymptotic 
expansion of the solution to (1651) for r —)■ 0. 

For small r, the solutions of fl65l) have the leading-order asymptotic behavior 

ipn{r) = a + br^\nr + 0{r^). (67) 

For such a solution, the equation of motion fl65l) relates the coefficients of the asymptotic 
expansion as 

2b = Xntt. (68) 
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This allows us to relate the potential at the brick wall to the normal electric held. Let 
us expand the perpendicular electric held on the boundary as Ej^{x) = The 

electric held at the brick wall is 


En = —N ^dr(pn = 261n(e 


(69) 


and the value of the potential at the surface is determined up to terms of order by En as 

, 2b En . . 

(70) 


r=0 ~ ^ ~ \ ~ \ 1 

An Anln(e \ 


Inserting the mode expansion into the on-shell action fl66l) we hnd that 

I{E±) = ^ / (PnN~^{dripn) drA'0^Volj' = ^ / ipnN~^{dripn)dT = ^ 

^ „ JdBxT ^ „ JdB ^ „ J 


From the asymptotic expansion 0671) . we hnd that to leading order in e, 

fSEl 


HEi) = 


(71) 


( 72 ) 


2Anln(e-^) 

The eigenvalue A„ appears in the denominator; thus the integral over E± will lead to a 
functional determinant det(A)’''^/^, like the partition function of a negative scalar. 

Note that the argument e~^ of the logarithm in fl72l) is dimensionful, and needs to be 
compensated by another dimensionful factor. This dimensionful factor is determined either 
by A„, which is related to the transverse wavenumber of the perturbation, or by the length 
scale R associated to the background. This is the same behavior as in the calculation of 
Zd/n in section |Vl In the case where R is large we can appeal to the Rindler limit in which 
N{r) = r. Then the solution to Eq. fj65ll is 

( 73 ) 

In this case the dimensionful factor of e in the action is compensated by A„. In de Sitter 
space of radius R, one can show that 

A„r2 


(Pn{r) = riFi(0^r) ~ 1 -h ^^^ln(VA^r) C>(r^). 




1 + 


ln(r/i?) -|- 0{r^ 


( 74 ) 


where the exact solution can be written in terms of hypergeometric functions. Which dimen¬ 
sionful constant compensates for the dimensions of e depends on whether the given mode 
is larger or smaller than the length scale of B. These logarithms appearing in the action 
lead to In In R terms in the entanglement entropy, similar to those that appeared for massive 
scalars in Ref. These types of terms will be analyzed in more detail in a future work 


47|. 


4t. 
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B. Functional measure 


We have established that the edge modes take the form of a negative scalar determinant 
on the entangling surface, which agrees with the contact term up to a constant prefactor. In 
order to match this prefactor, we have to carefully define the path integral measure VE±. 
We do this by taking the continuum limit of the discrete measure on the lattice. Consider 
a discrete set of N boundary points {xi} in E, representing the points at which the links of 
the lattice pierce the entangling surface. We imagine the surface is tesselated so that each 
of these points is assigned a volume Vol(J^)/iV. In the lattice theory, the integrated electric 
flux through each point is quantized in units of q. The sum over these discrete values dehnes 
the lattice measure 

* ^ * '' 

The Kronecker 6 ensures that we sum only over conhgurations such that the total flux 
through the entangling surface vanishes; this is because we have already accounted for 
solutions for which the flux through the entangling surface is constant as part of Z^- 

In order to carry out the integration, we can change variables from E±{xi) to the coeffi¬ 
cients En of a mode expansion via 


E±{x) = y^^En-il^njx). 


(76) 


To take the continuum limit we make the replacement 

Vol(J') f 


E - 


qN 


dE±{x). 


(77) 


Expressed in terms of the mode expansion, the measure fl75]) (without the delta function) is 

(78) 


VE,^Yl-dYm<tE.. 


N 


This can be seen by comparing the calculation of J VE± exp (— y/^Ej_{x)) in the mode 

expansion and in the continuum limit of 

We now have to restrict to those field configurations such that the total electric flux 
through the surface vanishes. This can be obtained by inserting the ^-function; 


(5 



= (5 


VvolOT 



(79) 










where Eq is the coefficient of the constant zero mode. Performing the zero mode integral 
with the help of Eq. fl79l) we are left with the path integral measure 


VE^ = 


1 

^/N 



Vol(J^) 

N 


dEn- 


(80) 


Now we can carry out the path integral using the measure (|80|) and the action (I72)), with 
the result 


V _ 

•^edge 


n 




1 /Vol(.F) 


N 


27 rln(e 


1/2 


Zs ^ .^ 27 rVol(^)ln(e 

det 


-I'l 


Vn V 


g2iV/3 


^0 


1/2 


(81) 

including the factor Ze coming from the quantized constant edge mode. We can now rescale 
this formula using the zeta function identity flT^ : det'(aA) = a'^^^’^Met'(A). Up to an 
anomaly term, (C(Aj^, 0) = — dimker(Aj^) = —1. After rescaling, and taking into account 
that the entangling surface consists of xi^) components, the edge mode partition function 
is 


-^edge 




dn(e-i) V27rVol(J^) 

~q 


-X{B) 


det (A 




(82) 


We see that the partition function of the edge states agrees with the result from the contact 
term (EHD, up to the factors Ze (|36l) and Z^/n = (lii(e“^)//d)^/^ (152|) . which appear exactly 
as needed fl58|) to make the geometric and entanglement entropies agree. Thus we have 
provided the geometric entropy with a manifestly statistical interpretation. 


VII. LOGARITHMIC DIVERGENCE IN FOUR DIMENSIONS 


The case of four dimensions is special, because four-dimensional Maxwell theory is con¬ 
formal. In a conformal field theory, the entanglement entropy of a spherical region has a 
universal logarithmic divergence, whose coefficient is related to the trace anomaly 421. l48|. 

Four dimensions is also interesting because of the connection to AdS/CFT. In a conformal 
theory with a holographic dual described by Einstein gravity, the Ryu-Takayanagi (RT) 
formula relates the entanglement entropy to the area of a minimal surface that extends 


into the buf 


bj]. The logarithmic term in this entro py is related to the holographic trace 


anomaly which is protected by supersymmetry 


49( 1 . Thus, although the RT formula 


becomes tractable only at strong coupling, there is a universal part that should agree at 
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strong and weak coupling. Thus the entanglement entropy gives us a possible check on the 
RT formula that can be carried out at weak coupling. 

While the RT formula has passed many nontrivial checks given the subtlety of 
entanglement entropy in gauge theories, there is still the question of which entropy RT 
calculates, given that there are multiple possible candidates for the role of “entanglement 


entropy” 33|. The derivations of the RT formula 43 


51 


(or the related replica trick 


52| involve the geometric entropy 


53| ) and therefore one expects that RT calculates the geometric 


entropy. We shall show below that at weak coupling, for Maxwell helds, this geometric 
entropy includes a contribution from edge modes, as needed to agree with the trace anomaly. 
Since RT also agrees with the trace anomaly, we expect that the RT entanglement entropy 
of strongly coupled Yang-Mills theory already includes an edge mode contribution, but we 
will only perform the edge mode calculation at weak coupling. 

Consider the entanglement entropy of a ball of radius r in 3 -|- 1 dimensions. Using 
conformal symmetry, the entanglement entropy can be equivalently expressed as the thermal 
entropy in hyperbolic space M x or as de Sitter entropy of the static patch of de Sitter 
space. The logarithmic divergence in the entanglement entropy is related to the a-type trace 
anomaly 43|, 


m = 




/0( 

^abcd^ 


abed 


(83) 


where E 4 = RabcdR°‘^'^'^ — ‘^RabR^^^ + is the 4-dimensional Euler density, whose integral is 


/ y/g E 4 ^ = 327r^x, and Cabed is the Weyl tensor. The logarithmic divergence in the entropy 
is given by: 

Ranom ~ 4aln(e/r), (84) 

where ~ denotes agreement of logarithmic divergences, and e is an ultraviolet cutoff length. 
For a theory of no scalars, ni /2 Dirac fermions and ni gauge helds the trace anomaly predicts 


a geometric entropy 


Ml 


Ra 




(85) 


Ref. 4^ calculated the entanglement entropy of a free theory of no scalars, ni /2 Dirac 
fermions and ni gauge helds. The logarithmic divergence in the entropy was found to be: 




thermal 


^ (^0 + Y^i /2 + 32ni^ ln(e/r) 


( 86 ) 
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This result was obtained by making a conformal transformation to de Sitter space, calculat¬ 
ing the thermal entropy as a function of temperature, and then integrating the first law of 
thermodynamics. Comparing (18 5 p with (186 p . we see that the thermal entropy of the scalar 
and spinor fields agree with the corresponding trace anomalies, but for the gauge fields the 
two differ. Since the coefficients of log divergences are expected to be universal (i.e. inde¬ 
pendent of the regulator scheme), this discrepency cannot be attributed simply to the choice 
of regulator. 


However, the discrepency can 
When integrating the first law. 


De resolved by including the entanglement of edge modes. 


44j | assumed that the entropy vanishes in the zero tem¬ 


perature limit. That would have been the case if we had kept the hnite lattice spacing 
(introduced temporarily in section IVTp . but it is not true for the edge modes in the contin¬ 
uum, even at hnite brick wall radius, because there are a continuum of edge modes at any 
nonzero temperature. There is a contribution which is independent of /?, and therefore does 
not vanish in the (3 —?■ -|-cxd limit. This divergent contribution to the entropy is missed by 
the thermodynamic calculation of Ref. 441. 


The result (l86ll can also be found following [2^, l39| by calculating the thermal entropy 
density on and multiplying by the regularized volume. However such a procedure misses 
any non-extensive contributions coming from boundary effects, such as the edge mode con¬ 
tribution of section lYD 

We now calculate the entropy of the edge modes on hyperbolic space and show that when 
added to the thermal result (1861) . the result is in agreement with the trace anomaly (]85p . The 
argument proceeds almost identically to section I VII except that the manifold is no longer a 
product manifold. Instead we use conformal symmetry to map the entanglement entropy of 
a sphere to the thermal entropy on the static universe x for which the metric is 


ds^ = -I- du^ -|- smh{uYdQl. 


(87) 


Under this transformation the entangling surface is mapped to u = oo, so that the brick 
wall at r = e is mapped to 

Mmax = -ln7^. (88) 

2 r 

To find the edge modes we £x the electric flux E± at u = Umax, and solve for the potential 
ip in the interior: 

VV = 0, duP |n=n,na. = E±. (89) 
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We can expand E±_ and in spherical harmonics: 

Ede,ct>)= Y, E,mYem{9A), 

£>0,m 

^p{u,9,(p)= Y ^e,m{u)Yem{9,(l)). 

£>0,m 


Equation flH^ then becomes 

d^(f£,m(u) + 2 tanh(u)duip£,m(u) - smh(u)~^£(£ + 
whose solution is 


0 . 


(Pi,m{u) oc tanh(M)^ 2-^1 (|, ^,tanh(M)^) 

where 2 Ei{a,b;c; z) is the hypergeometric function. 

To hnd the action, we can use the asymptotic expansion of cp, given by 

ipi^rn OC 1 — 2 £{£ + + C>(e““) 


(90) 

(91) 


(92) 


(93) 


(94) 


This gives the value of 99 at m = m 


to leading order in e, 
e^lne-^ ^ 


The Euclidean action is then given by 




Ine-i 

£{£+l)Y 


jp 2 


(95) 


(96) 


Since we are interested in the logarithmic divergence, we can ignore the independent factors 
in the action and the measure. 

We recognize the factor £{£ + 1) in fl96|) as the spectrum of the Laplacian on the sphere. 
Since it appears in the denominator, it leads to an edge mode partition function equal to that 
of a wrong-sign scalar held, exactly as in I VI I The logarithmic divergence of this partition 
function is given by the well-known trace anomaly of a two-dimensional scalar held. 


m = 


R 

24^' 


(97) 


This determines the logarithmic divergence in the partition function of the edge modes, and 
hence the entropy 




edge 


InZ 


edge 


gin).). 


(98) 


Thus the thermal entropy (l 86 D should be supplemented by an edge mode contribution for 
each of the rii gauge helds. This result agrees with the result of the trace anomaly ([85]). 
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VIII. DISCUSSION 


We have shown that in Maxwell theory, the geometric entropy calculated by Euclidean 
path integral methods agrees with the entanglement entropy using a brick wall regulator 
plus edge modes. Since the latter system manifestly has a statistical interpretation, the 
former does as well. The contact interaction, which in the Euclidean path integral comes 
from nonminimal coupling to curvature, is equivalent to the entanglement entropy of the 
edge modes (up to the correction described in section |V]). This resolves a longstanding puzzle 
about the interpretation of the contact term. 

To leading order, the contact term takes the form of a negative scalar held conhned to 
the entangling surface. The contact term affects the universal subleading divergences in the 
entanglement entropy, which are the coefficient of the logarithmic divergence in even dimen¬ 
sions and the constant in odd dimensions. Since the entangling surface has codimension two, 
the edge modes have subleading divergences of the same order, and hence contribute to these 
universal terms. In section lyTTl we examined these divergences in the case of 3-1-1 dimensions. 


44j | between the entanglement entropy 


resolving an apparent discrepancy hrst noted in Ref. 
and the trace anomaly. 

In sections Hm through IVl we have assumed a special class of geometries of the form BxJ^. 
The main role of this assumption was to be able to isolate the contact term contribution 
to the partition function by expressing all the local degrees of freedom as scalar helds. In 
more general geometries, it is not clear whether such a division is possible. However, the 
division of the entropy into an edge mode contribution and a bulk contribution fl5T|) is more 
general and does not depend on the background geometry. It is straightforward to generalize 
the edge mode calculation to geometries with a rotational symmetry around the entangling 
surface, as was done in section IVIII in the special case of hyperbolic space. This symmetry 
allowed us to use the geometric entropy formula ([I]). To calculate entanglement entropy for 
more general regions by Euclidean methods one should instead use the replica trick 531. 
In this more general setting, the entangling surface may have nonzero extrinsic curvature 
that must be taken into account in the short distance expansion near the entangling surface 
fl59|) . This would allow us to determine the dependence of the entanglement entropy on the 
extrinsic geometry of the entangling surface, to compare to l55 |. 

We used a lattice regulator to dehne the measure of the path integral over edge modes. 
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and we expect that we would obtain similar results if we had calculated the entropy entirely 
on the lattice (perhaps using the replica trick). The leading order area law divergence of 
the von Neumann entropy on the lattice should be manifestly positive, but we expect that 
the log divergences should agree with the continuum result. 

In this article we have only considered the case of abelian gauge helds; it would be 
interesting to extend the results to the nonabelian case. For some purposes SU(A^) gauge 
helds can be treated in the weakly coupled limit as if it were — 1 decoupled copies of the 
U(l) abelian theory, modulo zero mode issues related to the different shape and topology 
of the gauge group. Thus one would expect at weak coupling the entropy should grow like 
7V^. If, following Ref. 33|], we allow only entanglement between gauge invariant operators 


this seems unlikely to be the case, since the number of gauge-invariant operators grows 
more slowly than the number of local degrees of freedom. In the dehnition of entanglement 
entropy proposed for lattice gauge theories in Ref. 32| there is an additional term in the 
entropy that grows with the dimension of the gauge group, and which might correct for this 
dehciency. It remains to be seen whether either of these dehnitions of the entropy has the 
anticipated quadratic scaling with N. 

A comment is warranted about the negative sign in the exponent of the contact term in 
the heat kernel; this is the phenomenon that led to the conclusion in Ref. 8| that the Maxwell 
held has negative entropy. It seems surprising that one could arrive at a negative entropy 
starting from a manifestly positive lattice expression. The negative result arises in taking 
the continuum limit of this discrete expression. To see how such a term arises, consider 
taking the continuum limit of the entropy S = —^plnp over a discrete set of values with 
spacing a. Now consider the limit a ^ 0, with the probability density p = p/a held hxed, 
so that S = —^p\np^ — J plnp — Ina. The former expression is the continuous entropy, 
which becomes negative for tightly spaced distributions, but this negative term is always 
compensated by the positive divergent piece — In a. However when we sum over modes the 
continuous entropy becomes increasingly negative for higher modes as the action increases 
with the value of the electric held. The — In a term, on the other hand, is the same for each 
mode and can be absorbed into a rescaling of the helds or as a local counterterm. Thus one 
arrives in the continuum at an expression that appears formally negative. 

Zeta function regularization eliminates all power law divergences, but one can retain 
the power law divergences by instead inserting a UV cutoh e directly into the heat kernel 


34 






expression for the partition function of a scalar field, as done by Kabat 8|: 

tr e“ 


InZ = - 


I roo 4-^ ^-sA 


ds = ^ Indet A, 


(99) 


where the 


represent power-law divergences 


56[ | (including a constant in even dimen¬ 


sions). Note that even a single mode produces a lne“^ divergence when inserted into this 
expression. This occurs because the path integral measure for each scalar mode requires a 
dimensionful factor p to be inserted for it to be well dehned. In the heat kernel regulator, the 
convention is that we choose n = e“^, but as a consequence even a single mode appears to be 
UV divergent. Because of this convention, one hnds that the leading order cosmological con¬ 
stant divergence of the scalar held is positive. If we had instead imposed a hard momentum 
cutoff A, while leaving p hxed in the A —)■ cx) limit, the leading order cosmological constant 
becomes negative. Since the edge mode contribution is like minus one scalar held, a hard 
cutoh on the edge modes would produce a positive leading contribution to the entropy. This 
illustrates the dangers in assigning too much signihcance to the divergent terms generated 
by the heat kernel method. 

Thus we see that the negativity of the Kabat contact term is an artifact of the particular 
regulator scheme he used. However, the ambiguity of sign is only possible in the hrst place 
because the edge modes are continuous and need a path integral measure VEx^ to be well 
dehned. 

A similar negative contact term appears in the heat kernel for gravitons 4,l9|]. Presumably 
the graviton contact term can also be understood in terms of edge modes due to gravitational 
constraints. However, there are many additional conceptual problems involved in dehning 
the entanglement entropy of graviton helds (reviewed in Appendix A of j^h 

The entanglement entropy of quantum helds on a black hole background provides an 
additive contribution to the total entropy (references in ^), and may conceivably explain 
it all, when integrated up to the Planck scale 58|, 1^. In the limit of a large black hole, we 
may write the entropy as 


Sbh — 


A 


A 


AS, 


matter ? 


( 100 ) 


4G 4G(A) 

where G(A) is the ehective value of the Newton’s constant at the energy scale A. If S'matter 
has a positive divergence, this leads to screening of G. 

On the other hand, if gravitons and gauge helds provide a negative divergence in 5'matter, 
this leads to antiscreening of Newton’s constant G. This suggests that G may how to a 
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JV fixed point at positive values of G. This is the asymptotic safety scenario proposed in 




61| (see 62|, for reviews). However, the RG flow of G, being a power law divergence 


in R) > 2, is a scheme-dependent quantity, raising questions about whether this provides 


meaningful evidence about the consistency of the theory in the deep UV 


64| . In a different 


scheme where the entropy is inherently positive, such as the lattice regulator, any UV hxed 
point would need to be at negative values of G. 

Moreover, a negative ^matter would be in some tension with the expected positivity of 
entropy from the statistical mechanics of black holes. It is expected that the black hole en¬ 
tropy A/AG has a microscopic state counting interpretation in terms of a true von Neumann 
entropy —tr(pln p) of a discrete number of states. For this to be true, the theory of quantum 
gravity must somehow discretize the continuous E_[_ degrees of freedom, much as the lattice 
regulator does. If this is the right way to think about black hole entropy, it would seem to 
follow that S'edge should be positive in the fundamental theory of quantum gravity. 

The contact terms also appear to play a role in making the perturbative one-loop string 
contribution to the entanglement finite, cutting it off at the string length scale 0 ,Q,Q- 


Here the negative and positive divergences exactly cancel. But since a negative divergent 
contribution to the black hole entropy makes little sense for the reasons described above, 
presumably this description is valid only at distances larger than the Planck scale. We 
expect that in a nonperturbative definition of string theory, the black hole entropy will be 
manifestly positive. 
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